Sequence Listing was accepted. 

If you need help call the Patent Electronic Business Center at (866) 
217-9197 (toll free) . 
Reviewer: Anne Corrigan 

Timestamp: [year=2007; month=12; day=14; hr=9; min=59; sec=44; ms=179; ] 



Validated By CRFValidator v 1.0.3 
Application No: 10031496 Version No: 



7.0 



Input Set: 



Output Set: 



Started: 2007-11-26 18:43:03.925 
Finished: 2007-11-26 18:43:08.778 
Elapsed: 0 hr(s) 0 min(s) 4 sec(s) 853 ms 
Total Warnings : 9 9 

Total Errors: 0 
No. of SeqIDs Defined: 99 
Actual SeqID Count: 99 

Error code Error Description 

W 213 Artificial or Unknown found in <213> in SEQ ID (1) 

W 402 Undefined organism found in <213> in SEQ ID (2) 

W 213 Artificial or Unknown found in <213> in SEQ ID (3) 

W 402 Undefined organism found in <213> in SEQ ID (4) 

W 402 Undefined organism found in <213> in SEQ ID (5) 

W 402 Undefined organism found in <213> in SEQ ID (6) 

W 402 Undefined organism found in <213> in SEQ ID (7) 

W 402 Undefined organism found in <213> in SEQ ID (8) 

W 213 Artificial or Unknown found in <213> in SEQ ID (9) 

W 213 Artificial or Unknown found in <213> in SEQ ID (10) 

W 213 Artificial or Unknown found in <213> in SEQ ID (11) 

W 213 Artificial or Unknown found in <213> in SEQ ID (12) 

W 213 Artificial or Unknown found in <213> in SEQ ID (13) 

W 213 Artificial or Unknown found in <213> in SEQ ID (14) 

W 213 Artificial or Unknown found in <213> in SEQ ID (15) 

W 213 Artificial or Unknown found in <213> in SEQ ID (16) 

W 213 Artificial or Unknown found in <213> in SEQ ID (17) 

W 213 Artificial or Unknown found in <213> in SEQ ID (18) 

W 213 Artificial or Unknown found in <213> in SEQ ID (19) 

W 213 Artificial or Unknown found in <213> in SEQ ID (20) 



Input Set: 



Output Set: 



Started: 2007-11-26 18:43:03.925 

Finished: 2007-11-26 18:43:08.778 

Elapsed: O hr(s) 0 min(s) 4 sec(s) 853 ms 

Total Warnings: 99 

Total Errors: 0 

No. of SeqIDs Defined: 99 

Actual SeqID Count: 99 

Error code Error Description 

W 213 Artificial or Unknown found in <213> in SEQ ID (21) 

W 213 Artificial or Unknown found in <213> in SEQ ID (22) 

W 213 Artificial or Unknown found in <213> in SEQ ID (23) 

W 213 Artificial or Unknown found in <213> in SEQ ID (24) 

W 213 Artificial or Unknown found in <213> in SEQ ID (25) 

W 213 Artificial or Unknown found in <213> in SEQ ID (26) 

This error has occured more than 20 times, will not be displayed 

W 402 Undefined organism found in <213> in SEQ ID (98) 

W 402 Undefined organism found in <213> in SEQ ID (99) 



SEQUENCE LISTING 



<110> National Renewable Energy Laboratory (NREL) 

<120> Cellobiohydrolase I Gene and Improved Variants 

<130> NREL 99-45 

<140> 10031496 

<141> 2002-01-14 



<160> 99 

<170> Patentin version 3.4 

<210> 1 

<211> 45 

<212> DNA 

<213> Artificial 

<220> 

<223> Nucleotide encoding linker 

<400> 1 

cctcccggcg gaaacccgcc tggcaccacc accacccgcc gccca 4 5 



<210> 2 
<211> 15 
<212> PRT 

<213> Trichoderma reesei 
<400> 2 

Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr Thr Arg Arg Pro 
15 10 15 



<210> 3 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Nucleotide encoding linker 

<400> 3 

ggcggaaacc cgcctggcac cacc 2 4 



<210> 4 

<211> 1551 

<212> DNA 

<213> Trichoderma reesei 



<400> 



4 



atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc tcagtcggcc 60 

tgcactctcc aatcggagac tcacccgcct ctgacatggc agaaatgctc gtctggtggc 120 

acgtgcactc aacagacagg ctccgtggtc atcgacgcca actggcgctg gactcacgct 18 0 

acgaacagca gcacgaactg ctacgatggc aacacttgga gctcgaccct atgtcctgac 24 0 

aacgagacct gcgcgaagaa ctgctgtctg gacggtgccg cctacgcgtc cacgtacgga 30 0 

gttaccacga gcggtaacag cctctccatt ggctttgtca cccagtctgc gcagaagaac 360 

gttggcgctc gcctttacct tatggcgagc gacacgacct accaggaatt caccctgctt 420 

ggcaacgagt tctctttcga tgttgatgtt tcgcagctgc cgtgcggctt gaacggagct 48 0 

ctctacttcg tgtccatgga cgcggatggt ggcgtgagca agtatcccac caacaccgct 54 0 

ggcgccaagt acggcacggg gtactgtgac agccagtgtc cccgcgatct gaagttcatc 60 0 

aatggccagg ccaacgttga gggctgggag ccgtcatcca acaacgcgaa cacgggcatt 660 

ggaggacacg gaagctgctg ctctgagatg gatatctggg aggccaactc catctccgag 720 

gctcttaccc cccacccttg cacgactgtc ggccaggaga tctgcgaggg tgatgggtgc 78 0 

ggcggaactt actccgataa cagatatggc ggcacttgcg atcccgatgg ctgcgactgg 84 0 

aacccatacc gcctgggcaa caccagcttc tacggccctg gctcaagctt taccctcgat 90 0 

accaccaaga aattgaccgt tgtcacccag ttcgagacgt cgggtgccat caaccgatac 960 

tatgtccaga atggcgtcac tttccagcag cccaacgccg agcttggtag ttactctggc 1020 

aacgagctca acgatgatta ctgcacagct gaggaggcag aattcggcgg atcctctttc 108 0 

tcagacaagg gcggcctgac tcagttcaag aaggctacct ctggcggcat ggttctggtc 1140 

atgagtctgt gggatgatta ctacgccaac atgctgtggc tggactccac ctacccgaca 120 0 

aacgagacct cctccacacc cggtgccgtg cgcggaagct gctccaccag ctccggtgtc 1260 

cctgctcagg tcgaatctca gtctcccaac gccaaggtca ccttctccaa catcaagttc 1320 

ggacccattg gcagcaccgg caaccctagc ggcggcaacc ctcccggcgg aaacccgcct 138 0 

ggcaccacca ccacccgccg cccagccact accactggaa gctctcccgg acctacccag 1440 

tctcactacg gccagtgcgg cggtattggc tacagcggcc ccacggtctg cgccagcggc 150 0 

acaacttgcc aggtcctgaa cccttactac tctcagtgcc tgtaaagctc c 1551 

<210> 5 
<211> 514 
<212> PRT 

<213> Tr ichoderma reesei 



<400> 5 



Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
15 10 15 



Ala Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
20 25 30 



Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

35 40 45 



Val Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 
50 55 60 



Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
65 70 75 80 



Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
85 90 95 



Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 
100 105 110 



Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 
115 120 125 



Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
130 135 140 



Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
145 150 155 160 



Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
165 170 175 



Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 
180 185 190 



Cys Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly 
195 200 205 



Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 
210 215 220 



Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu 
225 230 235 240 



Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu 
245 250 255 



Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 
260 265 270 



Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 
275 280 285 



Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 
290 295 300 



Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr 
305 310 315 320 



Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 
325 330 335 



Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 
340 345 350 



Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 
355 360 365 



Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 

370 375 380 



Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
385 390 395 400 



Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 
405 410 415 



Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 
420 425 430 



Val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 
435 440 445 



Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
450 455 460 



Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
465 470 475 480 



Ser His Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val 
485 490 495 



Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 
500 505 510 



Cys Leu 



<210> 6 
<211> 514 
<212> PRT 

<213> Trichoderma reesei CBH1-N45A 
<400> 6 

Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
15 10 15 



Ala Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
20 25 30 



Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 
35 40 45 



Val Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Ala Ser Ser 
50 55 60 



Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
65 70 75 80 



Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
85 90 95 



Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 
100 105 110 



Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 



115 



120 



125 



Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
130 135 140 



Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
145 150 155 160 



Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
165 170 175 



Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 
180 185 190 



Cys Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly 
195 200 205 



Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 
210 215 220 



Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu 
225 230 235 240 



Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu 
245 250 255 



Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 
260 265 270 



Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 
275 280 285 



Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 
290 295 300 



Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr 
305 310 315 320 



Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 

325 330 335 



Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 
340 345 350 



Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 
355 360 365 



Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 
370 375 380 



Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
385 390 395 400 



Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 
405 410 415 



Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 
420 425 430 



Val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 
435 440 445 



Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
450 455 460 



Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
465 470 475 480 



Ser His Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val 
485 490 495 



Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 
500 505 510 



Cys Leu 



<210> 7 

<211> 514 

<212> PRT 

<213> Trichoderma reesei CBH1-N270A 

<400> 7 



Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
15 10 15 



Ala Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
20 25 30 



Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 
35 40 45 



Val Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 
50 55 60 



Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
65 70 75 80 



Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
85 90 95 



Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 
100 105 110 



Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 
115 120 125 



Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 

130 135 140 



Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
145 150 155 160 



Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
165 170 175 



Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 
180 185 190 



Cys Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly 
195 200 205 



Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 
210 215 220 



Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu 
225 230 235 240 



Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu 



245 250 255 



Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 
260 265 270 



Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Ala Thr 
275 280 285 



Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 
290 295 300 



Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr 
305 310 315 320 



Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 
325 330 335 



Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 
340 345 350 



Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 
355 360 365 



Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 
370 375 380 



Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
385 390 395 400 



Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 
405 410 415 



Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 
420 425 430 



Val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 
435 440 445 



Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
450 455 460 



Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
465 470 475 480 



Ser His Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val 
485 490 495 



Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 
500 505 510 



Cys Leu 



<210> 8 
<211> 514 
<212> PRT 

<213> Trichoderma reesei CBH1-N384A 
<400> 8 

Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
15 10 15 



Ala Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
20 25 30 



Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 
35 40 45 



Val Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 
50 55 60 



Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
65 70 75 80 



Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
85 90 95 



Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 
100 105 110 



Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 
115 120 125 



Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
130 135 140 



Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
145 150 



